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Title; Recombinant Plant Enzyme 
Field of the Invention 

This invention relates to the novel nucleotide sequence of 
a gene encoding a plant enzyme, functional equivalents 
thereof , vectors containing the novel nucleotide sequence, 
a method of producing the enzyme, a method of altering the 
characteristics of a plant and plants having been so 
altered . 

Background of the Invention 

Many plants and plant-derived materials (eg sunflowers, 
oil seed rape) are of commercial significance because they 
are a source of valuable vegetable oils. These oils (or 
lipids) consist of fatty acid residues (relatively long 
hydrocarbon chains) joined by ester links to propan -1, 2, 
3 - triol (otherwise known as glycerol). 

In some higher plants, de novo fatty acid biosynthesis is 
known to be catalyzed by two enzymes, acetyl-CoA 
carboxylase and fatty acid synthetase. The end product of 
de novo fatty acid biosynthesis is palmitoyl-ACP (Acyl 
Carrier Protein) which is then rapidly elongated to 
stearoyl-ACP, A highly specific stearoyl-ACP desaturase 
ensures that the only unsaturated fatty acid in high 
abundance is oleoyl-ACP. Another enzyme, acyl-ACP 
thioesterase is thought to play an important role in the 
chain termination of fatty acid biosynthesis by catalysing 
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the hydrolysis of acyl-ACP to free fatty acid and ACP. It 
is believed that subsequently the free fatty acid is 
incorporated into complex lipids. 

Acyl-ACP thioesterases may be characterised by their 
preferred substrates. Thus acyl-ACP thioesterases showing 
preferential enzymatic activity for long chain fatty acids 
(eg C18 compounds) may be termed "long chain 
thioesterases" . 

Some long chain acyl-ACP thioesterases from plants have 
been identified and purified. These include the enzymes 
from Avocado and Saf flower (Ohlrogge et al., [1978] 
Archives of Biochemistry and Biophysics 189 , 382-391 and 
McKeon & Stumpf, [1982] Journal of Biological Chemistry 
257 , 12,141-12,147, respectively). Bowever, no-one has 
been able to obtain the DNA sequence encoding such a long 
chain thioesterase. Such information would allow the 
application of modern recombinant DNA techniques to 
manipulate the enzyme characteristics (eg level of 
expression), and hence the characteristics of transgenic 
plants. 

Summary of the invention 

In one aspect the invention provides a nucleotide 
sequence, encoding an enzyme precursor having an acyl-ACP 
thioesterase activity, comprising the sequence of 
nucleotides 169-1269 shown in Figure 1 (Seq. ID No, 1) or 
functional equivalents thereof. 

As will be apparent to those skilled in the art, 
functional equivalents of the nucleotide sequence of the 
invention include, for example: those nucleotide sequences 
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which encode the same polypeptide with the same activity 
(i.e. which cleave acyl-ACP molecules ) but which, by 
virtue of the degeneracy of the genetic code, possess a 
different nucleotide sequence; sequences which encode 
substantially the same polypeptide but wherein there may 
be one or more conserved amino acid substitutions (i.e. 
the substitution of an amino acid for one with similar 
properties); sequences which encode substantially the same 
polypeptide (which preferably share at least 50% amino 
acid homology and more preferably at least 60% homology) 
but wherein there may be one or more minor deletions or 
truncations; and sequences which hybridize under standard 
conditions to the complement of nucleotides 169-1269. 
Typically such functional equivalents will have at least 
75% nucleotide sequence homology and preferably at least 
85% homology. An example of a functional equivalent is 
the sequence pNL3 (Seq. ID No. 3) shown in Figure 2. 

A particular example of a functional equivalent is the 
sequence comprising the antisense equivalent to the 
sequence of the invention. Whilst antisense sequences are 
not generally understood to be functional equivalents, use 
of the term functional equivalent is intended for the 
purposes of the present application to encompass such 
sequences. 

Another particular example of functional equivalents are 
those sequences which code for the mature enzyme rather 
than the enzyme precursor. Studies described below show 
that the precursor includes an N-terminal transit peptide 
of about 4kDa. Thus the N-terminal 27-37 amino acids 
(preferably the N terminal 29-35 amino acids) could be 
omitted. Alternatively, the transit peptide could be 
exchanged for the transit peptide of another protein (e.g. 
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that of ACP ) . 

Preferably the sequence also comprises a suitable 5' 
untranslated region, including a promoter, to enable 
expression in appropriate host cells • 

Preferably the sequence also comprises a suitable 3' 
untranslated region. As well as a stop codon, this 3 r 
untranslated region can comprise other signals, such as a 
polyadenylation signal. 

In another aspect, the invention provides an enzyme 
precursor having an acyl-ACP thioesterase activity, 
comprising the amino acid sequence shown in Figure 1 (Seq. 
ID No* 1) or functional equivalents thereof. 

As described previously, functional equivalents can 
include mature enzymes lacking the N terminal transit 
peptide (which is thought to be cleaved between Ala33 and 
Val 34, or in the vicinity thereof). Other functional 
equivalents include chimaeric polypeptides comprising the 
transit peptides of other proteins (such as* that from 
ACP ) . 

An example of such an enzyme precursor is the acyl-ACP 
thioesterase from Brass ica napus (oil seed rape), the gene 
for which has been cloned and sequenced by the inventors. 
The sequences of the full length cDNA clones obtained, 
pNL2 and pNL3 , are shown in* Figures 1 and 2. A number of 
sequence variations can be seen (detailed in Figure 3). 

Some of these nucleotide sequence differences result in 
different deduced amino acid sequences, as shown in Figure 
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It is thought that all these sequence variations are 
manifestations of the same phenomenon: it is believed that 
the B. napus acyl-ACP thioesterase enzyme is encoded by a 
multi-gene family (i.e. there are a number of acyl-ACP 
thioesterase allelic genes in B. napus ). This has already 
been demonstrated for the B. napus ACP gene (Safford et 
al., (1988), European Journal of Biochemistry 174 , 287- 
295). There is also evidence to suggest that there are 
several (10-20) copies of the thioesterase gene in the B. 
napus genome (see Figure 6). 

Thus in a specific embodiment the invention provides a 
nucleotide sequence encoding an enzyme precursor having 
acyl-ACP thioesterase activity and comprising the amino 
acid sequence shown in Figure 1, or functional equivalents 
thereof. 

In another aspect, the invention provides a vector 
containing a nucleotide sequence in accordance with the 
invention. 

A further aspect of the invention comprises a cell 
transformed with the vector defined above and thus capable 
of expressing an enzyme precursor with acyl-ACP 
thioesterase activity. 

Such a transformed cell may be of bacterial, fungal, plant 
or animal origin. 

Such transformed cells can be grown in cultures using 
methods well known to those skilled in the art. Thus, in 
the case of a transformed plant cell r one may obtain a 
transgenic plant. 
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In yet another aspect, the invention provides a method of 
producing an enzyme precursor having acyl-ACP thioesterase 
activity comprising the steps of: inserting a sequence 
encoding the enzyme precursor or a functional equivalent 
into a suitable expression vector; transforming a suitable 
host cell with said vector; growing said transformed host 
cell in suitable culture conditions; and obtaining the 
enzyme precursor from the host cells and/or from the 
culture medium. 

Preferably, a sequence is employed which encodes a 
functional equivalent so as to produce the mature enzyme. 

In view of the disclosures made herein, suitable vectors, 
host cells and culture conditions will be apparent to 
those skilled in the art. 

It may be advantageous to obtain the enzyme precursor free 
from contamination by other plant proteins or products, 
thus in a preferred embodiment the enzyme precursor is 
expressed in an animal or bacterial cell. Most preferably 
a microorganism is used to express the enzyme precursor. 
Conveniently the precursor is secreted. 

The invention has a number of applications. It might 
prove possible, by under- or over-expressing this enzyme 
precursor (or mature enzyme), to alter the properties of 
the storage lipid molecules of the plant in which the 
enzyme precursor is expressed. For instance, by over- 
expressing the enzyme precursor it might be possible to 
bring about premature termination of the hydrocarbon chain 
elongation during fatty acid biosynthesis, thus resulting 
in lipids with shorter hydrocarbon chain fatty acid 
components. Such "medium chain" lipids are useful as 
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"hardening stock" in the production of margarines and 
detergents. Alternatively, by use of an anti-sense 
construct, it might be possible to reduce the levels of 
active enzyme in the plant, resulting in the increased 
production of longer chain fatty acids. Equally, because 
the enzyme catalyses the last stage in fatty acid 
biosynthesis, it should prove possible to modify the fatty 
acid levels of the plant. 

Typically the plants to be altered could include rape, 

sunflower, saf flower, soybean, peanut, cotton, oil palm or 
corn. 

Thus in another aspect the invention provides a method of 
altering the characteristics of a plant, comprising 
introducing into the plant the sequence of the invention 

or a functional equivalent thereof, so as to alter the 
level of acyl-ACP thioesterase activity. 

■ 

Preferably the enzyme precursor or function equivalent is 
selectively expressed in the plant seeds. This can be 
achieved by expressing the precursor or functional 
equivalent under the control of a seed specific promoter. 
Several such promoters are known (e.g. those for the ACP, 
napin and cruciferin genes). 

The characteristics one might expect to alter, in addition 
to acyl-ACP thioesterase activity are those influenced by 
levels of the enzyme precursor or mature enzyme, such as 
fatty acid yield and composition. 

Preferably the characteristic altered is the storage oil 
yield or composition of the seeds of the plant. 
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The acyl-ACP thioesterase from a. napus exhibits a strong 
substrate specificity, with oleoyl-ACP (18:1-ACP) being 
the preferred substrate. Ideally, the product hydrocarbon 
should have chain lengths shorter than 18 carbon atoms. 
One plant which contains storage lipids with fatty acid 
components of the preferred length is Cuphea. Thus, by 
use of DNA probes based on knowledge of the B. napus 
enzyme sequence, and by use of antibodies raised against 
the purified B. napus enzyme, it might be possible to 
identify clones containing the sequence encoding the 
corresponding Cuphea enzyme, which could be used to modify 
fatty acid biosynthesis in the desired manner. 

* 

The invention is further described by reference to the 
following examples and drawings of which: 

Figure 1 shows the DNA sequence of cDNA clone pNL2 which 
encodes the B. napus acyl-ACP thioesterase gene and the 
deduced amino acid sequence of the enzyme, 

Figure 2 shows the nucleotide sequence of cDNA clone pNL3 
which encodes an allelic variant of B. napus- acvl-ACP 
thioesterase, and the deduced amino acid sequence, 

Figure 3 shows the variations in nucleotide sequence 
between pNL2 and pNL3, 

Figure 4 shows the variations in the deduced amino acid 

sequence of the proteins encoded by pNL2 and pNL3 

Figure 5 shows the amino acid sequences obtained by direct 
sequencing of peptides generated by enzymatic digestion of 
B. napus acyl-ACP thioesterase, with boxes indicating 
residues which show variation from the amino acid sequence 
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expected from the cDNA sequences, and 

Figure 6 shows a 3. napus genomic Southern blot probed 

with an oilseed rape thioesterase probe. 

Example 1 

Figures 1 and 2 show the nucleotide sequence of pNL2 and 
pNL3 respectively, full length cDNA clones of the B. 
"apus acyl-ACP thioesterase gene r together with the 
deduced amino acid sequences of the respective proteins 
encoded. Several differences are observed between the 
amino acid sequence obtained directly from the protein and 
that deduced from the cDNA clones (see Figure 5). 

Clones pNL2 and pNL3 were obtained in the following 
manner. 

Poly A RNA was isolated from 25-30 days after flowering 
(D.A.F.) B. napus embryos, as previously described [Hall 
et al. (1978), Proceedings of the National Academy of 
Sciences 75.r 3196-3200], and used to construct a lambda 
gt10 cDNA. The cDNA was made essentially according to the 
manufacturer's instructions (Pharmacia) and then ligated 
into the vector and packaged in vitro . Basically, double 
stranded oligo dT-primed cDNA was synthesised from poly A + 
RNA according to Gubler and Hoffman [Gene (1983) 25:203- 
269]; EcoRI linkers were ligated to the ends, and the 
resulting material cloned into a bacteriophage vector, 

* 

lambda gt10 (Amersham International). 

Approximately 50 , 000 clones were screened using an 
oligonucleotide probe (RAAT 3AS - see below, Seq. ID No. 
5), derived from the rape thioesterase peptide VMMNQDT. 
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Several positive clones were obtained and the longest 
(12Q0bp) was subcloned into pTZISR (U.S.B.) making a 
construct termed pNLI . Following sub-cloning into MI3, 
overlapping fragments of pNLI were sequenced in both 
directions. 

A 

ATA C 
RAAT 3 AS 5" GTGCCCTGGTTCAATCATGAC 3' 21 mer 

T 

Translation of the DNA sequence gave rise to an open 
reading frame (ORF) of 233 amino acids plus a 325bp 3 f 
non-coding sequence. The ORF contained a number of 
sequences that had been identified by direct amino acid 
sequencing of rape acyl-ACP thioesterase peptides. pNL1 
was therefore determined to code for rape acyl-ACP 
thioesterase. 

A second B. napus cDNA embryo library was constructed in 
lambda ZAPII (Stratagene ) , according to the manufacturer's 
protocol. 100,000 plaques were screened with a 7 30bp 
Bglll pNL1 fragment and eight positive clones were 
obtained. The eight clones were purified to homogeneity 
and in vivo excised, according to the Sratagene protocol, 
to give rescued plasmids containing the cDNA insert within 
the polylinker. Restriction analysis revealed that two 
plasmids, pNL2 and pNL3, contained inserts of 1560bp and 
1460bp respectively. pNL2 and pNL3 were subcloned into 
Ml 3 and sequenced. Translation of the resulting DNA 
sequences showed both pNL2 and pNL3 to contain ORFs of 363 
amino acids, but differing in the length of their 3 1 non- 
coding sequences. 
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Neither pNL2 nor pNL3 contained an initiating methionine 
and, in order to obtain this, the technique of 5' RACE 
(Rapid amplification of cDNA ends) was employed, 
essentially as described by Frohman et ah [Proceedings of 
the National Academy of Sciences 75 , 8998-9002]. 1ug of 
25-30 D.A.F. B. napus embryo poly A + RNA was reverse 
transcribed at 37°C for 60 min with 200 units M-MuLV 
Superscript reverse transcriptase (BRL) and 500 pmoles 
random hexamers as primer. Excess primers were removed on 
a. Centricon 100 column and cDNA precipitated with 
isopropanol. After resuspension in 10ul TE buffer, the 
cDNA was tailed using 10 units terminal transferase (BRL) 
in buffer (BRL) and 200uM dATP at 37°C for 10 min. After 
treatment at 65°C for 5 min the mixture was diluted to 500 
ul with TE, 10ul mixture was used in a 100ul second 
strand Polymerase Chain Reaction (PCR) with 50 pmol rape 
thioesterase specific primer PTB5 

[5' GATTAGCAATTGTCTCGACGG, Seq. ID No, 6], 
50- pmol R primer 

[5' AAGGATCCGTCGACATC Seq. ID No. 7] 

and 5pmol T 17 -Ro"" R I P rimer 

[5' AAGGATCCGTCGACATCGATAATACGACT 

CACTATAGGGATTTTTTTTTTTTTTTTTT, Seq. ID No. 8]. 

The mixture was heated to 95°C for 7 min then held at 

72 °C. lOul 10 x PCR buffer (BRL), 200uM dNTPs and 2.5 

units Taq polymerase were added, heated at 45°C for 2 min 

then 72°C for 40 min. To PCR amplify the products, the 

mixture was heated to 94°C for 45 min to denature cDNA, 

primer annealed at 50° for 25 min and extended at 72°C for 

3 min. This cycle was repeated 34 times, followed by 

final extension at 72°C for 10 mins. 

lul of the above PCR products was used in a second 
amplification containing 200uM dNTPs, 2.5 units Taq, 50 
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pmole R-j 

;5' GACATCGATAATACGAC / Seq. ID No. 9], 

50 pmole rape thioesterase specific primer PTB2 

[5 f CAAGAATTAACAGGGCTGATG , Seq. ID No. 10], 

(internal to PTB5), PCR buffer added to 50ul. Mixture * 
heated 94° for 45 min, 50°C for 30 min and 72°C for 1 min 
- 28 cycles followed by final extension at 72° for 10 rain. 
The products of this second PCR amplification, after size 
selection on agarose gels, were ligated into pT7 Blue (AMS 
Biotechnology). Recombinants were identified by PCR using 
50 pmol R 1 primer and 50 pmol PTB3, another rape 
thioesterase specific primer [5 r GCGATTAACCGGGATGAAAAG , 
Seq, ID No. 1 1 ] 

internal to PTB2. This mixture was heated to 94°C for 1 
min, 55 °C for 1 min, 72 °C 1.1/2 min. This cycle was 
repeated 24 times and the products subjected to a final 
extension at 72° for 10 min. 

144 colonies were screened and 15, containing the largest 
inserts, were selected for DNA sequence analysis. The 2 
longest inserts, RACE 6 and RACE 1, were both found to be 
180bp longer than the 2 cDNA clones pNL2 and pNL3 at the 
5* end and to contain a putative initiating methionine. A 
160bp overlapping region between the cDNAs and the RACE 
products enabled the origin of the 2 RACE products to be 
determined. Thus RACE 6 was identical in sequence to pNL2 
and RACE 1 was identical to pNL3. 

The complete nucleotide and deduced amino acid sequences 
of pNL2 and pNL3 are shown in Figure 1 and Figure 2 
respectively. The 2 composite clones pNL2 and pNL3 
represent entire copies of the structural genes of rape 
acyl-ACP thioesterase. Excluding poly (A) tails, pNL2 
contains 1642bp and pML3 1523bp. Both pNL2 and pNL3 have 
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putative initiation methionines at nucleotide 169, which 
show good homology to the plant consensus translation 
initiation sequence, and to encode open reading frames 
(ORFs) of 366 amino acids. The two clones differ in the 
length of their 3' non-coding sequences. The open reading 
frames of pNL2 and pNL3 encode polypeptides of 41,960 Da 
and 41 r 9 83 Da respectively and these represent the 
precursor form of the thioesterase. On SDS-PAGE, mature 
rape thioesterase migrates with a molecular weight of 38 
kDa (see below) suggesting a transit peptide of about 4 
kDa. Since the N-terminus of the mature protein is 
blocked to Edman degradation the precise cleavage site of 
the precursor polypeptide could not be determined. 

Figure 3 shows the nucleotide sequence variation between 
cDNA clones pNL2 and pNL3 . Some of these sequence 
differences are observed to be in the ORFs, and result in 
different deduced amino acid sequences for the 
polypeptides encoded by pNL2 and pNL3. These differences 

* 

are illustrated in Figure 4. 

To further characterise the protein, it was highly 
purified, as described below. (All steps were carried out 
at 4°C unless otherwise stated.) 

Step 1; Preparation of cell free extract 

Rape seed OOOg), which had been harvested between 40-60 
days post anthesis (after flowering), and stored at -70°C, 
was homogenised in 250ml 20raM potassium phosphate, pH 7.3, 
containing 2mM dithiothreitol and 0.2mM 

phenylmethylsulf onyl fluoride using a polytron at maximum 
speed. The resulting homogenate was stirred for 30 
minutes with Triton X-100 (stock solution 20% v/v) at a 
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final concentration of 1% v/v. After centr i f uaation at 
20,000g for 30 minutes the supernatant was filtered 
through glass wool to remove the floating lipid layer. 

Step 2: Ammonium Sulphate fractionation 

The filtrate was brought to 30% saturation by the addition 
of solid ammonium sulphate and the suspension was stirred 
for 30 minutes prior to centr if ugation at 20,000g for 30 
minutes. The supernatant was refiltered through glass 
wool, and brought to 50% saturation with solid ammonium 
sulphate. After stirring for 45 minutes the extract was 
centrifuged for 20 minutes at 30,000g. The pellets were 
stored at -20°C until required. 

Ammonium sulphate pellets were resuspended in Buffer A 
(20mM bis-Tris HC1, pB 6.5, containing 2mM dithiothreitol 
and 0.2mM PMSF) and centrifuged at 30 OOOg for 10 minutes 
to remove particulate matter. 

Step 3: Fast Flow Q-Sepharose chromatography 

The supernatant was diluted with Buffer A until the 
conductivity was less than 20 x 10~ 4 ohms (final volume 
200ml) and loaded onto a Fast Flow Q-Sepharose column 
(2.5cm x 12cm) equilibrated in 20mM bis-Tris HC1, pH 6.5. 
After loading, the column was washed with equilibration 
buffer until absorbance at 280nm was less than 0.15. 
Enzyme activity was eluted with a 300ml gradient from 0- 
0.5M NaCl in Buffer A. Fractions (5ml) were collected and 
assayed for activity. The flow rate throughout was 1.3ml 
minute"^ . The unbound sample was assayed for activity and 
typically less than 10% of the activity loaded did not 
bind. Thioesterase activity eluted between 150 and 300mM 
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NaCl. Fractions containing activity were pooled (115- 
120ml) and stored overnight at -70°C. 

Step 4; ACP-Sepharose chromatography 

The Q-Sepharose pooled fractions were rapidly thawed and 
applied to an ACP-Sepharose column (1cm x 13cm) 
equilibrated in 20mM potassium phosphate pH 6.5. After 
loading, the column was washed with 0.1M potassium 
phosphate, pH 6.5, alone. Activity was eluted with 0.3M 
potassium phosphate pH 8.0. All buffers contained 2mM DTT 
and 0.2mM PMSF and the flow rate throughout was 0,5ml 
minute \ Fractions (5ml) were collected and assayed for 
activity. Using freshly synthesised affinity matrix, 
greater than 95% of the activity loaded was bound and no 
activity was detected in the wash fractions. Typically, 
30-40% of the activity loaded was eluted. Fractions 
containing activity were pooled (25-30ml) and stored at 
-20°C overnight. Typically 7 0% of activity was retained 
after thawing. 

Step 5: F.P.L.C Mono P chromatography 

The ACP-Sepharose pool was dialysed against 2.51 25mM 
bis-Tris, iminodiacetic (pH 7.1) acid for 4.5 hours with 
one change of buffer after two hours. The conductivity of 
the dialysed sample was checked and, if necessary, the 
sample was diluted with 25mM bis Tris pH 7.1 until the 
conductivity was less than 10 x 10~ 4 ohms. The sample was 
loaded via a 50ml superloop onto a Mono P column (HR 5/20) 
equilibrated in 25mM bis-Tris iminodiacetic acid pH 7.1 at 
a flow rate of 1 ml minute" at room temperature. The 
column was washed with equilibration buffer and activity 
was eluted with 50ml 10% polybuffer 74 ( v/v )/ iminodiacetic 
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acid, pH 4.0, at a flow rate of 1 ml minute -1 . Fractions 
(1ml) were collected into tubes which contained 50ul 1M 
bis-Tris pH 9.9 to immediately neutralise the elutant 
since thioesterase activity appeared to be unstable at low 
pH. To determine the pH elution gradient, the column was 
calibrated with a blank run prior to loading the enzyme. 
Under these chromatographic conditions, all activity bound 
to the column and typically 40-50% of the activity loaded 
was recovered in five fractions. 

Active fractions eluted from the Mono P column were 
immediately concentrated on an Amicon 10 

microconcentrator , pre-coated with protein to prevent non- 
specific binding. Concentrated sample (final volume 1ml) 
was brought to 0.2M NaCl and stored at -20°C overnight. 
Samples stored at -20°C lost almost no activity while 
those stored at'4°C lost typically 60% activity. 

Step 6; F.P.L.C. Superose 12 gel filtration 

On thawing, the Mono P concentrated pool was centrif uged 
on an MSE centaur for 5 minutes prior to loading onto two 
Superose 12 gel filtration columns HR10/30 (1 x 30cm) in 
series. To obtain maximum purification, care was taken 
not to overload the column and therefore 200ul aliquots of 
the supernatant were loaded for each run. The column 
running buffer was 50mM sodium phosphate, pH 7.2, 
containing 150mM NaCl. Fractions (200ul) were collected 
and assayed for activity. Activity eluted as one peak 
over 5-6 fractions and typically 45% of the activity 
loaded was recovered. Five runs were necessary to purify 
the concentrated sample. 

Once partially purified, the enzyme was further 
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characterised. The intact enzyme was found to have a 
- molecular weight of about 70 kDa, being composed of a 
dimer of approximately 38kDa sub-units. A degraded form 
of the enzyme (33kDa) was found to co-purify with the 
intact 38kDa subunits. The pH optimum was 9.5. As with 
other plant acyl-ACP thioesterases so far described, the 
preferred substrate of the B. napus enzyme was found to be 
oleoyl-ACP (18:1-ACP). 

The partial amino acid sequence of the purified protein 
was determined as described below. 

The highly purified preparation, after Mono P 
chromatography, was concentrated and reductively alkylated 
with 4-vinyl-pyridine. Protein samples were loaded 
directly, via a 100ul loop, onto a Brownlee C-8 (CO3-032) 
column pre-equilibrated in 0.1% TFA (v/v). Flow rate was 
O.Tral min . The column was washed with 0.1% TFA until 
absorbance at 214nm was close to baseline. Protein was 
eluted with a gradient of increasing Buffer B (90% 
acetonitrile; 0.0 85% TFA) 0-70% over 3 5 min. It was 
necessary to pause the gradient for 40-45 min at 38% 
Buffer B to elute isocratically a major absorbance peak 
which contained no protein and was probably a contaminant 
from 4-vinyl-pyridine. Protein peaks were collected 
manually into Eppendorf tubes, fractions containing acyl- 
ACP thioesterase polypeptides (38 and 33kDa) were pooled. 

After reversed-phase chromatography, the pooled purified 
enzyme preparation was brought to near dryness on a Univap 
concentrator. The appropriate digestion buffer, detailed 
below, was added and the pH was measured using indicator 
paper and adjusted with 1M NaOH. During the concentration 
step, a heavy precipitate formed which disappeared on 
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neutralisation. The final volume of samples prepared for 
digestion was 100-150ul. 

Preparations containing 3-7ug (80-180 pmoles) of purified 
thioesterase polypeptides (38 and 33kDa) were digested 
with 0.15ug Endoproteinase Lys-C (Boehringer Mannheim). 
To check for auto-digestion a control sample was prepared 
with 0.15ug protease in 150ul tOOmM Tris/HCl (pH 8.5) r 2mM 
EDTA (digestion buffer). Incubations were left at 37°C 
for 10-16 hours. Additional Endoproteinase Lys-C (0.15ug) 
was added to the thioesterase and control incubations. 
Digestion was allowed to proceed at 37°C for 40 hours. 
Samples were stored at -20 °C prior to reversed-phase 
chr oma tog r aphy . 

Similarly a preparation, containing 9ug (240 pmoles) of 
purified enzyme (38 and 33kDa polypeptides), was digested 
with 0.18ug TPCK-typsin (Cooper Biomedical) overnight at 
37°C. A control sample was prepared with 0.1 8ug TPCK- 
trypsin in 100ul 0.2M ammonium carbonate/1raM calcium 
chloride (digestion buffer) and incubated as for the 
protein sample. Fresh TPCK-trypsin was prepared and a 
further 0.1 8ug was added to the protein and control 
incubations. Both samples were left at 37°C for 8 hours. 
Samples were stored at -20°C prior to reversed-phase 
chromatography. 

Reversed-phase chromatography was performed at room 
temperature on Gilson HPLC equipment adapted for microbore 
column chromatography. Samples were loaded, via a 100ul 
loop, onto an tyjuapore RP 300-C8 microbore column (250 x 
1mm id; 7u) pre-equilibrated in 0.1% TFA. Flow rate was 
0 . 1ml/minute. Peptides were eluted with a gradient of 
increasing buffer B (90% acetonitr ile ; 0.08 5% TFA) 0-70% 
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over 100 minutes. Absorbance was monitored at 214nm and 
peaks were collected manually into Eppendorf tubes. Prior 
to loading sample, acetonitrile gradients were performed 
until a reproducible low baseline was obtained. 

N-terminal Protein Sequencing was performed on an Applied 
Biosystems model 475 protein sequencer as described 
previously (Cottingham et al., [1988] Biochim. Biophys. 
Acta 954 , 201-207). A number of peptide sequences were 
obtained. These are shown in Figure 5 and are Seq. ID Nos 
12 (peptide 26), 13 (peptide 15), 14 (peptide 30), 15 
(peptide 34) and 16 (peptide 39). The residues boxed in 
the tryptic peptides are different to those expected from 
translation of the cDNA sequences. These differences are 
thought to be due to the allelic nature of the gene. 

From the partially determined amino acid sequence, it was 
possible to identify nucleic acid sequences encoding the 
enzyme. The techniques and methods of accomplishing this 
are well known to those skilled in the art. Figure 5 
shows a genomic Southern blot probed with thioesterase - 
specific probe. Molecular standard markers (obtained by a 
PstI digest of lambda DNA) are shown on the right hand 
side with their size shown in kilobases. The middle four 
lanes represent controls having a known copy number (1, 2, 
5 or 10) whilst Hpa, Sac and Bam are restriction digests 
of rape genomic DNA resulting in 12, 17 and 16 copies 
respectively per haploid genome. The probe used was a 
764bp fragment generated by PCR, using primers PTB1 AS and 
PTB2AS (Seq. ID Nos 17 and 18 respectively), shown below: 

?TB 1 AS 5' CCGAGCTCGTCGATTGATGGAAGA 3' 24-mer 201-224bp 

PTB2AS 5' TTGGTACCGGTAAGCTTTGAGATC 3' 24-mer 942-965bp 
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Example 2 

This example describes the construction of a plant 
transformation vector containing an antisense rape * 
thioesterase fragment linked to a seed-specific acyl 
carrier protein (ACP) promoter for use in altering fatty 
acid composition and oil yield in transgenic oil seeds. 

The 764 bp fragment of pNL2 (described above) was produced 
by PCR amplification using the primers PTB AS1 • PTB AS1 
and PTB AS2 incorporated SacI and Kpn1 restriction sites 
respectively into the fragment. A 100ul PCR reaction 
contained 10ul 10x reaction buffer (Stratagene) , 200uM 
dNTPs, 5.0 pmole PTB AS1 , 50 pmole PTB AS2 f 100pg pNL2, 2.5 
units Taq (Stratagene). Reaction conditions were 92° for 
45 sec, 55°C for 45 sec, 72°C for 90 sec for 25 cycles 
followed by 72 °C for 10 min. The 764 bp fragment was 
recovered via DEAE paper from agarose gels, digested with 
SacI and Kpn I, re-run on agarose gels and recovered from 
DEAE paper. 

A 1.4kb BamHI-Bglll fragment of pTZSBS [de Silva et al. , 
(1992) Plant Molecular Biol. 18:1163-1172], representing a 
seed specific ACP promoter sequence, was ligated into 
BamHI-restricted pTZ!8R [Mead et al. f (1986) Prot. Engin. 
1: 64-74] to produce pTZAP! . pTZAP! was restriction 
analysed to confirm correct orientation of ACP promoter 
fragment. pTZAP! was restricted with SacI and Kpn1 and 
recovered from agarose gel via DEAE paper. The recovered 
pTZAP! was ligated with the 764 bp NL2 fragment from above * 
to form pNL4. pNL2 and the plant transformation vector 
pBI 101 (Clontech) were digested with BamH1 and Sad. The ' 
2.15kb fragment of pNL2 and the pBI 101 vector fragment 
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were recovered from DEAE cellulose and ligated to form 
pETRT. pETRI was subjected to restriction analysis to 
confirm correct insertion of gene fragments. 

Example 3 

Agrobacterium-mediated Transformation of Brassica napus 

The vector pETRI was transferred into Agrobacter ium 
tumefaciens pGV3850 (Zambryski et al., (1983) EMBO J. 
2:2143-2150) using a direct DNA uptake procedure [An et 
al., (1988) In Plant Molecular Biology Manual (Gelvin & 
Schilperoort, Eds.), Kluwer Academic Publishers, pp. 1- 
19]. From the resultant Agrobacterium colonies, DNA was 
extracted, transformed into E. coli from which it was re- 
isolated to confirm correct gene insertion. 

Brassica napus (cv. Westar) stem segments were transformed 
with the above binary Agrobacterium strain using the 
procedure of Fry et al. [(1987) Plant Cell Reports £: 321- 
325] with the following modifications: kanamycin 
selection was at 20ug/ml and was delayed until 2 weeks 
after infection; carbenicillin was replaced by cetotaxime 
(500ug/ml); arginine was omitted from the regeneration 
media; 0.8% agar was replaced by 1% agarose; a 2-3 day 
pre-treatraent of stem segments was carried out prior to 
Agrobacterium infection and a N. plumbag inif olia cell line 
[Barfield et al. ( 1985), Plant Cell Reports, 4^: 104-107] 
was used as feeder layer. Shoots formed and remaining 
green on Plant Cell Reports, 4j 104-107] was used as 
feeder layer. Shoots formed and remaining green on 
selective media after 2 transfers were tested for the 
presence of nopaline [Otten and Schilperoort (1978), 
Biochem. Biophys. Acta, 527: 497-500]. Positive shoots 
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were transferred to soil, potted on into 5" diameter pots 
and transferred to growth rooms operating a ISh day (22°C) 
and 8h night <18°C) cycle. 

Control plants were transformed with A. tumefaciens 
pGV3850 containing pAPIGOS [de Silva et al. (1992), Plant 
Molecular Biology, J8^: 1163-1T72] f a plasmid in which the 
ACP promoter controls the expression of the beta- 
glucuronidase (GOS) reporter gene. 

Example 4 

Analysis of Transformed Plants 

DNA was extracted from pETR1 antisense thioesterase plants 
and from APIGOS transformed control plants and Southern 
blotted to confirm the presence of inserted genes. 

Mature seeds from 39 pETR1 transformed plants and 10 
APIGOS transformed plants were analysed for oil content 
and for fatty acid composition. 

After weighing, seeds were macerated in methanol, using an 
Ultraturrax homogeniser, and extracted overnight in 
chlorof orm/methanol (2:1 v/v). After removing seeds by 
filtration, samples were partitioned with 1% sodium 
chloride solution. The chloroform-soluble lipid fraction 
was removed and evaporated to dryness. To analyse for 
fatty acid composition, extracted lipids from above were 
refluxed for 60 rain with acid methanolysis reagent 
(toluene/methanol/conc. sulphuric acid 10:20:1). To 
determine the total fatty acid content of the samples 
heptadecanoic acid (C17:0) was added as an internal 
standard (to about 20% of total lipid weight). After 
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addition of water, the resultant fatty acid methyl esters 
were extracted with hexane and analysed by gas 
chromatography on a Carlo Erba chromatogr aph equipped with 
a Supelcowax 1 0/f used silica column* 

Table 1 shows the percentage total fatty acid (% TFA) 
content and the fatty acid composition of seeds from the 
transgenic plants. The fat content of the control APIGUS 
seeds (shown as AP1-TAK in Table 1) is seen to range from 
38.1%-27.3%. The fat content of the pETR1 antisense 
thioesterase seeds ranged from 43.2%- 14. 7%. The reduced 
oil content of the pETRl seeds could result from reduced 
levels of thioesterase activity. Acyl-ACP thioesterase 
serves as the chain terminating enzyme of de novo fatty 
acid biosynthesis, cleaving the synthesized acyl-ACP to 
free fatty acid and ACP. As such, decreasing the level of 
this terminating enzyme, via antisense technology, could 
have limited the overall rate of oil synthesis in the 
seed. 

■ 

In respect of fatty acid composition, 2 pETRl plants, 19 
and 9, have altered fatty acid compositions of potential 
commercial use. Seeds from these plants both contain 
dramatically reduced levels of linolenic acid (18:3) and 
also significantly reduced levels of linoleic acid (18:2). 
The production of rape oils containing reduced 
polyunsaturated fatty acids, such as those from plants 19 
and 9, is highly desirable for use as cooking oils since 
they are less susceptible to oxidation. 

Example 5 



H 



This example describes the construction of plant 
transformation vectors containing sense rape thioesterase 
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genes linked to the seed specific ACP promoter for use in 
altering fatty acid yield and composition in transgenic 

oil seeds. 

To modulate fatty acid biosynthesis within cells of seed 
tissue by "overexpressing" rape thioesterase, it is 
necessary to target the thioesterase enzyme to the 
intracellular site of fatty acid synthesis, namely the 
plastid. Prior to obtaining a full length thioesterase 
cDNA, which contained a plastid targetting sequence, a 
chimaeric thioesterase gene was constructed in which a 
rape ACP plastid transit sequence was fused to a partial 
thioesterase cDNA which encoded for the mature enzyme. 
The chimaeric thioesterase gene was linked to the seed 
specific rape ACP promoter. 

A 361 bp fragment of pNL2, encoding the N-terminal region 
of the rape thioesterase, was produced from pNL2 by PCR 
using the primers PTB 26 

(5' CTTCTAGATCTCAATGTGCATT, Seq. ID No 19) and PTB 33 
(5' CCGAATTCTGC AGTCTCGTCTTCTC , Seq. ID No 20). 
PCR reaction (100ul) contained 10ul 1 0x reaction buffer 
(Stratagene) , 200uM dNTPs, 50 pmole PTB 26, 50 pmole PTB 
33, 100ng pNL2, 2.5 units Taq (Stratagene). Mixture 
heated 92°C for 30 sec, 55°C for 30 sec, 72°C for 60 sec 
for 25 cycles, followed by 72 P C for 10 min for final 
extension. The PTB 26 primer introduced a Xbal 
restriction site into the fragment and the PTB 33 primer 
introduced EcoRI and Pstl sites. Following isolation from 
agarose gels and DEAE paper the PCR fragment was 
restricted with EcoRI and Xbal, ligated into M13 and DNA 
sequenced to ensure correct sequence. An EcoRI - Bglll 

version of the PCR fragment was ligated into pNL6 to yield 
pNL1 1 . pNL6 was constructed by restricting pNL2 with 
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EcoRI and Bglll (partial) recovering the 1.2kb fragment, 
via DEAE paper; and ligating into EcoRI/BamH 1 restricted 
pTZ 1 9 . 

A 166 bp ACP transit sequence was obtained from rape ACP 

cDNA clone 29C08 [Safford et al . (1988), Eur. J, Biochem. 

174 ; 287-29 5] by PCR amplification using the primers PTB 
31 

(5' AATGTCGACCACTTTCTGCTC , Seq. ID No 21) and PTB 32 
(5 f TCGAGCTCTGCAGCGCAGGAGAC, Seq. ID No 22). 
PTB 31 introduced a Sail site into the fragment and PTB 32 
introduced Sad and Pst1 sites. The PCR product was 
recovered from agarose gel, via DEAE paper, restricted 
with Sail and Sad and cloned into M13 to confirm correct 
DNA sequence. The SAII-Sacl fragment was ligated into 
Sall/SacI restricted pBluescript SK + (Stratagene) to form 
pNL12. 

To link the ACP transit sequence to the mature 
thioesterase sequence pNL1 1 was restricted with Pst1 and a 
1.1kb fragment recovered and ligated into Pst1 restricted, 
phosphatased pNL12. This produced pNL13. 

The 1.4kb ACP promoter (API) was released from pTZ5BS by 
restricting with BamH1 and Sail and was ligated into 
3araH1/Sal1 restricted pUCBM20 (Boehr i r^er-Mannheim ) to 
yield pNL14. 

To link the ACP promoter to the chimaeric ACP-mature 
thioesterase sequence pNL13 and pNL14 were both restricted 
with Sail. A 1.2kb fragment was recovered from pNL1 3 and 
ligated into the linearised, phosphatased pNL1 4 vector to 
yield pNL15. Following restriction analysis to confirm 
the correct gene orientation, pNL15 was restricted with 
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BamH1 and Sad to release a 2.3kb fragment containing the 
ACP promoter, the AC? transit sequence and the mature 
thioesterase sequence. This fragment was ligated into the 
plant transformation vector pBI101 which had been 
restricted with BamH1 and Sad to remove the b- 
glucuronidase (GUS) fragment. This final plasmid, pARTEl , 
was propagated in E.coli and transferred into 

A. tumef aciens pGV3850 via the previously described direct 
DNA uptake method. 

The resulting A. tumefaciens strain was used to transform 

B . napus stem segments as described in Example 3 above. 
From the resulting transgenic plants, seeds will be 
analysed for altered thioesterase levels and for altered 
oil content and fatty acid composition. 

Example 6 

Upon obtaining a full length rape acyl-ACP thioesterase 
cDNA, a plant transformation vector was constructed 
containing this cDNA linked to the ACP promoter-sequence. 

Following EcoRI and partial Xbal restriction of pNL2 1.2kb 
fragment was recovered onto DEAE paper and ligated into 
pBluescript SK + to yield pNL16. A 18 0bp fragment 
containing 5' non-coding thioesterase sequence plus 3 
amino acids of the transit peptide was obtained by PCR 
amplification of pNL5 , a rape thioesterase genomic clone r 
using primers PTB 34 

(5* TTG TCG ACA CAC ATG CGT CAT, Seq. ID No 23) and PTB 
35 

(5 f GCA AGC TTC AAC ATT TTT GAT, Seq. ID No 24). 
Primer PTB 34 added an additional 6 bp of thioesterase 
sequence onto the RACE 6 product previously described and 



WO 93/18158 



PCT/GB93/00432 



- 27 - 

also a Sail restriction site. PTB 35 introduced a Hindlll 
site into the fragment. The 180bp PCR fragment was 
recovered from DEAE paper restricted with Sail and Hindlll 
and ligated into Sall/Hindlll restricted pBluescript SK + 
to give pNL17. DNA sequence analysis was carried out to 
confirm correct sequence. 

A 440bp fragment containing the transit peptide plus part 

of the mature thioesterase coding sequence was obtained by 

PCR amplification of pNL2 using primers PTB 36 

(5 1 ACA AGC TTT CGT GTA ATG TGA, Seq. ID No 25) and PTB 

37 

(5 1 GTG AAT TCA GAT CTC AAT GTG CAT T f Seq. ID No 26). 
The PTB 36 primer introduced a Hindlll site 3 1 to the 
thioesterase initiating ATG and PTB 37 introduced an EcoRI 
site 3* to the Bgl II site. The 440bp PCR fragment was 
recoved via DEAE paper, restricted with Hindlll and EcoRI 
and ligated into Hindlll/EcoRl restricted M13 to yield 
pM1 3HENL2 . Fragment was sequenced to confirm correct 
sequence. 

pNL17 was restricted with Hindlll/EcoRI and ligated to the 
440bp Hindlll/EcoRI fragment from pM 1 3HENL2 to yield 
pNL18. A 560bp Sal1-Bg1II fragment was recovered from 
pNL18 and ligated into pNL16 to form pNL19. pNLl9 
represents a 'full length 1 thioesterase cDNA. 

To link the thioesterase cDNA to the ACP promoter a 1.4kb 
Sal1-Sac1 fragment was released from pNL19 and ligated 
into Sal1-Sac1 restricted pNL14 to yield pNL20. pNL20 was 
restricted with BamH1 and Sad to yield a 2.8kb fragment 
containing the ACP promoter linked to the thioesterase 
cDNA. This 2.8kb fragment was ligated into the plant 
transformation vector pBI101, in which the BamHl-Sac1 
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fragment encoding the h-olucuronidase (GUS) gene had been 
removed. The final vector, pRATE, was propagated in 
E.coli and transferred into A. tumefaciens pGV3850 as 
previously described . 

The resulting A. tumefaciens strain was used to transform 
B. napus stem segments as described in Example 3. From 
the resulting transgenic plants, seeds will be analysed 
for altered thioesterase levels and for altered oil 
content and fatty acid composition. 
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CLAIMS 

1. A nucleotide sequence encoding an enzyme precursor 
having acyl-ACP thioesterase activity, comprising 
nucleotides 16 9-1269 of the sequence shown in Figure 1 
(Seq. ID No 1) or functional equivalents thereof • 

2. A sequence according to claim 1, comprising 
nucleotides 169-1269 of the sequence shown in Figure 2 
(Seq. ID No 3). 

3. A sequence according to claim 1 or 2, further 
comprising a 5* untranslated region, 

4. A sequence according to claim 1, 2 or 3, further 
comprising a 3" untranslated region. 

5. A sequence according to any one of the preceding 
claims, comprising the sequence obtainable from Brassica 
napus . 

6. A sequence according to any one of the preceding 
claims f further comprising a seed-specific promoter. 

7. A polypeptide having acyl-ACP thioesterase activity, 
comprising the amino acid sequence shown in Figure 1 (Seq. 
ID No 1) or functional equivalents thereof. 

8. A polypeptide according to. claim 7, comprising the 
amino acid sequence shown in Figure 2 (Seq. ID No 3). 

9. A vector comprising a nucleotide sequence according 
to any one of claims 1-6. 
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10. A host cell into which has been introduced the 
sequence of any one of claims 1-6. 

11. A plant host cell according to claim 10, which 
comprises part of, or is capable of giving rise to, a 
plant. 

12. A method of altering the characteristics of a plant, 
comprising introducing into the plant the sequence of any 
one of claims 1-6, so as to alter the level of acyl-ACP 
thioesterase act ivi ty . 

13. A method according to claim 12, wherein the 
characteristics so altered comprise the fatty acid yield 
and/or fatty acid composition of the plant. 

14. A method according to claim 12 or 13, wherein the 
yield and/or composition of seed storage oil of the plant 
is altered. 

15. A method of producing an enzyme having . acyl-ACP 
thioesterase activity, comprising: inserting the sequence 
of any one of claims 1-6 capable of expressing the enzyme 
into a suitable expression vector; transforming a host 
cell with said vector; growing said transformed host cell 
in suitable culture conditions? and obtaining the enzyme 
from the host cells and/or the culture medium. 

16. A method according to claim 15, wherein the host cell 
is a microorganism. 
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V630 v640 v650 v660 

CCA GCT TGG AGT GAT 6TT GTT GAG ATA GAG ACA TGG TGC CAG 

Pro Ala Trp Ser Asp Val Val Glu Ile Glu Thr Trp Cys Gln 

V670 v680 V690 v700 

AGT GAA GGA AGG ATT GGA ACG AGA CGT GAT TGG ATT CTA AGG 

Ser Glu Gly Arg Ile Gly Thr Arg Arg Asp Trp Ile Leu Arg 

V710 v720 v730 v740 

GAC TCT GCT ACA AAT GAA GTT ATT GGG CGT GCT ACA AGC AAG 

Asp Ser Ala Thr Asn Glu Val Ile Gly Arg Ala Thr Ser Lys 

V750 v760 v770 v780 

TGG GTG ATG ATG AAC CAA GAC ACA AGG CGG CTT CAA AGA GTT 

Trp Val Met Met Asn Gln Asp Thr Arg Arg Leu Gln Arg Val 

V790 V800 v810 v820 v830 



ACA GAT GAA GTT CGG GAC GAG 



AC TTG GTT TTC TGT CCT CGA 



Thr Asp Glu Val Arg Asp Glu Tyr leu Val Phe Cys Pro Arg 

V840 v850 v860 v870 

GAA CCC AGA CTA GCG TTT CCA GAA GAG AAC AAT AGC AGC TTA 

Glu Pro Arg Leu Ala Phe Pro Glu Glu Asn Asn Ser Ser Leu 

V880 v890 v900 v910 

AAG AAA ATC CCA AAA CTA GAA GAT CCA GCT CAG TAT TCT ATG 

Lys Lys Ile Pro Lys Leu Glu Asp Pro Ala Gln Tyr Ser Met 

V920 V930 v940 v950 

CTA GAG CTT AAG CCT CGG CGA GCT GAT CTG GAC ATG AAC CAG 

Leu Glu Leu Lys Pro Arg Arg Ala Asp Leu Asp Met Asn Gln 

v960 v970 v980 v990 

CAC GTG AAT AAC GTC ACC TAC ATT GGA TGG GTG CTT GAG AGC 
His Val Asn Asn Val Thr Tyr Ile Gly Trp Val Leu Glu Ser 

V1000 vlOlO V1020 V1030 V1040 

ATA CCT CAA GAA ATC ATT GAT ACG CAT GAG CTT CAA GTT ATA 
Ile Pro Gln Glu Ile Ile Asp Thr His Glu Leu Gln Val Ile 

V1050 V1060 V1070 V1080 

ACT CTA GAT TAC AGA AGA GAA TGC CAG CAA GAT GAC ATT GTA 
Thr Leu Asp Tyr Arg Arg Glu Cys Gln Gln Asp Asp Ile Val 

V1090 vllOO vlllO V1120 

GAT TCA CTC ACC ACC TCT GAA ATC CCT GAC GAC CCG ATC TCA 

Asp Ser Leu Thr Thr Ser Glu Ile Pro Asp Asp Pro Ile Ser 
yll30 vll40 vll50 vll60 

AAG CTT ACC GGG ACC AAC GGA TCT GCC ACG TCA AGC ATA CAA 

Lys Leu Thr Gly Thr Asn Gly Ser Ala Thr Ser Ser Ile Gln 

V1170 V1180 V1190 V1200 

GGA CAC AAT GAG AGC CAA TTC TTG CAT ATG CTG AGG TTG TCA 

Gly His Asn Glu Ser Gln Phe Leu His Met Leu Arg Leu Ser 
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V1210 V1220 V1230 V1240 V1250 

GAA AAT 66C CA6 GAG ATC AAT CGT 6GA AGA ACA CAA TGG AGA 
Glu Asn Gly Gln Glu Ile Asn Arg Gly Arc Thr Gln Trp Arg 

V1260 V1270 V1280 V1290 V1300 

AAG AAA TCC TCA CGA TGATTTTCTTATATGCTTCTCCAGTGTTGGTGAAG 

Lys Lys Ser Ser Arg ter 

V1310 V1 320 V1330 V1340 V1350 
ncnGCTTCACGTTTATAGTTTTAnGTGTCTTTTGATCAGATCTGTTGTGGGG 

V1360 V1370 V1380 V1390 V1400 vlftlO 
TTAGATGGTACTGGATTATTTGTGTAGTGTGGCGTCTGTGTTGTTCGTTTTCTGT 

V1420 V1430 V1440 V1450 V1460 

CCAMTTTTGGGCTTTAAGCAAAACGnTCnAGGTCTGCTTGTAGCTTTTCATA 

V1A70 V1480 V1490 V1500 V1510 V1520 
AACTA6ATTTGTTCATCCAATTATATATGCTGA6TTTCTCT6ATGAAACTGGGTA 

V1530 V1540 V1550 V1560 V1570 
T6T6AT6nG6AGGCAAMTnGCACAAnAAMTnGTAnACATAAGTTTTTT 

V1580 V1590 V1600 V1610 V1620 V1630 
GGATAAGCTAAAGnGTATGCMTGTAATACTTTTTTATTTATTGGAGATTAATA 

V1640 V1650 V1660 V1670 V1B80 

ATATTTTAAATAAAAAAAAAAAAAAAAAAAAAAAAMAAAAAAAAAAAAAAAAAA 

V1690 V1700 V1710 
AAAAAAAAAAAAAAAAAACTCGAG 
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VlO v20 v30 v40 

66C6TCATTGCCAATT66CGAAACAACCAGCTTTTTAAT6GCAATAT 

v50 v60 v70 v80 vQn vinn 

CGTATTTTACTAATCTCCACCTTCCTCGTTAACATCGAGCTTCAGATACAACACA 

vllO vl20 vl30 vian visn 

CAGATCCCTGATTCTTCTTCTATAAACCAAAACCTCAGGAACCATAAAAAAAAAA 

vl60 vl70 vl80 vl90 v?nn 

GGGCATCAAAA ATG TTG AAG CTT TCG TGT AAT GTG ACT AAC A AC 

Met Leu Lys Leu Ser Cys Asn Val Thr Asn Asn 

v210 v220 v?30 voiin 

TTA CAC ACC TTC TCC TTC TTC TCC GAT TCC TCC CTT TTC ATC 

Leu His Thr Phe Ser Phe Phe Ser Asp Ser Ser Leu Phe Ile 

v250 v260 v27Q vim 

CCG GTT AAT CGC CGT ACC ATC GCC GTC TCG TCT TCT CAG CTA 
Pro Val Asn Arg Arg Thr Ile Ala Val Ser Ser Ser Gln Leu 

v 290 v300 v310 v^9n 

Arc WS PpS SFI ? AT S CT F TA ? G6 GCA GTT ATC TCC GCG 

arg Lys Pro Ala Leu Asp Pro Leu Arg Ala Val Ile Ser Ala 

V330 v340 v350 v36f) 

GAT CAG GGA AGC ATC AGC CCT GTT AAT TCG TGT ACA CCG GCG 

Asp Gln Gly Ser Ile Ser Pro Val Asn Ser Cys Thr PrS Ala 

V370 v380 v390 vkQQ \/znn 

GAT CGG TTA CGA GCT GGT CGA TTG ATG GAA GAT GGT TAT TCG 
Asp Arg Leu Arg Ala Gly Arg Leu Met Glu Asp Gly Tyr Ser 

VA20 v^30 vZic;n 

TAC AAA GAG AAG TTC ATT GTT AGA AGC TAT GAG GTT G6GATT 

Tyr lys Glu Lys Phe Ile Val Arg Ser Tyr Glu Val Gly Ile 

V460 v470 v480 vUQC) 

AAC AAA ACC GCC ACC GTC GAG ACA ATT GCT AAT CTC TTA CAG 

Asn Lys Thr Ala Thr Val Glu Thr Ile Ala Asn Leu Leu Gln 

v500 v510 v520 v53n 

GAG GTG GCG TGT AAC CAT GTT CAG AAG TGT GGA TTC TCG ACG 

Glu Val Ala Cys Asn His Val Gln Lys Cys Gly Phe Ser Thr 

V540 v550 v560 v570 

GAT GGA TTT GCC ACA ACA CTC ACC ATG AGG AAA TTG CAT CTC 

Asp Gly Phe Ala Thr Thr Leu Thr Met Arg Lys Leu His Leu 

v580 v590 v600 v6io v6?n 

RS SIP $£I ?F A A6A £ T6 £ AC ATT 6AG ATC ^C AAA TAT 

Ile Trp Val Thr Ala Arg Met His Ile Glu Ile Tyr Lys Tyr 
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RAPE ACYI-AfP IHlfl£SI£RAS£ 



NL2 MLKLSCNVTN HLHTFSFFSDSSLF I PVNRRTLAVSSSQPRKPALDPLRAV 50 

NL3 N — 1 L 50 

NL2 ISADQ6S ISPVNSCTPADRFRAGRLMEDGYSYKEKFIVRSYEVGINKTAT 100 

NL3 — L 100 

NL2 VETI ANLLQ EVACNHVQKCGFSTDGFATTLTMRKLHL I WVTARMH IEI YK 150 

— • 150 

NL2 YPAWSDVVE I ETWCQSEGR I GTRRDW I LRDSATNEV I GRATSKWVMMNQD 200 

_ 200 

NL2 TRRLQRVTDEVRDEYLVFCPREPRLAFPEENNSSLKKIPKLEDPAQYSML 250 

— — 250 

NL2 ELKPRR ADLDMNQHVNN VTY I G WVLES I POE 1 1 DTHELQV ITLDYRRECQ 300 
300 

NL2 QDDIVDSLTTS EIPDDPISKLTGTNGSATSSIOGHNESQFLHMLRLSENG 350 
F M 350 

NL2 QEINRGRTQ WRKKSSR 3 65 

355 



Fig. 4 
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RAPE THIOESTERA SE PEPTIDE SEQUENCES 



NDOPROTETN ASE LYS-C 



PEPTIDE 26 



FI VRS YEVGI NK 



TRYPSIN 



PEPTIDE 15 



PEPTIDE 30 



PEPTIDE 34 



L S G D 



G Q E I N R 



LAFPE ENN 



VMMNQ DTR 



R 



PEPTIDE 39 



D W I L R 



Fig. 5 
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